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Protein kinase recognition 
sequence motifs 



Bruce E. Kemp and Richard 3. Peg: 



Protein kinases play a crucial role in the regulation of many cellular 
processes. They alter the functions of their target proteins by 
phosphorylating specific serine, threonine and tyrosine residues. 
Identification of phosphorylation site sequences and studies with 
corresponding model peptides have provided clues to how these 
important enzymes recognize their substrate proteins. This knowledge has 
made it possible to identify potential sites of phosphorylation in newly 
sequenced proteins as well as to construct specific model substrates and 
inhibitors. 



All protein kinases contain a common 
catalytic domain which typically 
extends over 240 residues 1 , including 
the binding sites for ATP and the pro- 
tein substrate (Fig. 1). The ATP-binding 
site is located at the amino terminus of 
the domain as characterized by the 
Rossmann motif, GXGXXG, while the 
centrally located Aspl84 is responsible 
for base catalysed transfer of the phos- 
phate to the protein substrate 2 . The 
binding site for the protein substrate is 
not unequivocally established, although 



some evidence favours the Idea that it 
is located in the carbon-terminal 60 
residues of the catalytic domain 3 . 

Early studies found that protein 
kinases phosphorylated their target 
proteins at discrete sites. These 
enzymes were shown to prefer certain 
exogenous substrates (such as casein, 
phosvitin and histones) and only phos- 
phorylated a limited number of avail- 
able sites. Indeed, Krebs and Fischer 
showed that phosphorylase kinase only 
phosphorylated SerH in phosphoryi- 
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ase which contains 64 Ser and Thr 
residues. Langan's pioneering work on 
histone phosphorylation demonstrated 
that several protein kinases may phos- 
phoryiate a common substrate at mul- 
tiple sites. The importance of substrate 
specificity was further highlighted by 
the discovery of the cAMMependent 
protein kinase by Walsh and his col- 
leagues. This enzyme did not phosphory- 
late phosphorylase but did phosphory- 
late a wide range of exogenous pro- 
teins. The idea that the multiple func- 
tions of the second messenger cAMP 
were all mediated by the cAMP-depen- 
dent protein kinase implied the enzyme 
recognized a panel of protein sub- 
strates that were subject to hormonal 
regulation. This raised the question of 
how protein kinases recognized specific 
residues out of the numerous hydroxy! 
groups in their substrates. 

It became evident that the local 
sequence around the phosphorylation 
site played a vital role in recognition by 
the cAMP-dependent protein kinase and 
that arginlne residues were involved. 
The cAMP-dependent protein kinase 
phosphorylated 0 casein B at Ser 124 in 
the sequence TERQ5LT (nos 120-126) 
but not in the more common variant p 
casein A 2 where Ser replaces Arg at 
position 122 (Ref. 4). This observation 
suggested that the cAMP-dependent 
protein kinase substrate recognition 
motif may be the RXS* sequence T . 
Studies in several laboratories in the 
mid-1970s demonstrated that the cAMP- 
dependent protein kinase readily phos- 
phorylated short . synthetic peptides 5 
and provided compelling evidence for 
the role of Arg residues. Significantly, 
the synthetic peptide modelled on the 
liver pyruvate kinase phosphorylation 
site sequence, LRRASLG (Kemptide), 
was phosphorylated with kinetic con- 
stants comparable to native protein 
substrates. This data suggests that all 
of the information necessary for recog- 
nition by the cAMP-dependent protein 
kinase could be present in the local 
phosphorylation site sequence and that 
RRXS was a preferred motif (reviewed 
in Ref. 5). While the primary sequence 
and proximity of Arg residues plays an 
important role, higher orders of struc- 
ture can have an overriding influence. 

f The phosphate acceptor site is indicated S* to dis- 
tinguish It from serine phosphate (S(P)} which may 
act as a specificity determinant for some protein 
kinases. Where the specificity determinants are 
known, less essential residues are marked X and 
determinant residues are shown in bold. The com- 
plete phosphorylation site sequence is grven where 
the major determinants are not known (see Table I). 
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Figure 1 

General structure of a protein kinase catalytic domain. Residues conserved in almost all 
known serine, threonine and tyrosine protein kinases are numbered according to the 
cAMP-dependent protein kinase catalytic subunit 



For example, lysozyme does not act as 
a substrate for the cAMP-<iependent 
protein kinase unless it is chemically 
modified. 

Protein kinase specificities 

The study of sequences surrounding 
the local phosphorylation site and the 
phosphorylation of model peptides has 
enabled the identification of phos- 
phorylation site motifs for a number of 
protein kinases (see Table 1). Although 
this is an extensive list, the explosion of 
protein kinase sequences generated by 
DNA cloning has outstripped our knowl- 
edge of their substrate specificities and 
identification of natural substrates* 

All of the calmodulin-dependent pro- 
tein kinases studied to date utilize basic 
residues as specificity determinants. 
The specificity of phosphorylase kinase 
was studied in detail by Graves and his 
colleagues 6 . In the model palindromic 
peptide LSYRRYSL (nos 1-8}, Ser2 is 
phosphorylated by phosphorylase 
kinase, whereas the cAMP-dependent 
protein kinase phosphorylates Ser7 on 
the carboxyl side of the adjacent 
arginines 6 . While there is a strong influ- 
ence of a carboxyl basic residue S*XR 
in the phosphorylase peptide, this is 
not an absolute requirement since the 
enzyme can phosphorylate sites with- 
out a basic residue in this position. The 
non-charged adjacent residues may also 
influence the phosphorylation of model 
peptides. The myosin light-chain ki- 
nases from both smooth and skeletal 
muscle also show restricted substrate 
specificity for myosin light chains. 
While the skeletal muscle isoenzyme 
will phosphorylate light chains from 
skeletal, cardiac and smooth muscle, 
Stull and his colleagues have shown 
that the smooth muscle enzyme has a 
strong preference for light chains from 
the same muscle. The smooth muscle 



myosin light-chain kinase requires the 
sequence KKRXXKXXS*, with the num- 
ber and spatial arrangement of the 
basic residues essential for favourable 
kinetics of phosphorylation and for 
directing the phosphate to the correct 
site 7 . In skeletal muscle myosin light 
chains the local phosphorylation site 
sequence contains Glu at residue 10 in 
the sequence PKKAKRRAAEGSS*NVFS 
(nos 1-17). Synthetic peptide analogs of 
the native sequence are phosphoryl- 
ated with low values, whereas 
those containing Arg at position 10, 
analogous to the smooth muscle light 
chains, are readily phosphorylated by 
skeletal muscle enzyme. This is a good 
example of a negative determinant that 
is apparent from peptide studies but 
presumably not accessible in the intact 
protein. To some extent studying the 
recognition requirements of protein ki- 
nases with highly restricted specificity 
ranges, such as phosphorylase kinase 
and the myosin light-chain kinases, is 
made difficult because of the lack of 
multiple natural phosphorylation site 
sequence for comparisons. This is also 
a problem for the tyrosine kinases. On 
the other hand, the multifunctional 
calmoduIin-<lependent protein kinases 
have broad specificities and recognize 
the motif RXXS*X in both proteins and 
peptide substrates 8 . The specificity 
requirements of the other members 
of the calmodulin-dependent protein 
kinase family are being explored in sev- 
eral laboratories. 

Protein kinase C has been the subject 
of numerous substrate specificity stud- 
ies. The initial studies were carried out 
on brain enzyme which consists of 
multiple isoenzymes. All protein kinase 
C preparations have a requirement for 
basic residues but there can be consid- 
erable variation in the juxtaposition 
and choice of Arg over Lys around the 
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phosphorylation site. Synthetic ^pep- 
tides containing the motif XRXXS XRX 
tend to be the best substrates and 
indeed this arrangement is present in 
the pseudosubstrate autoregulatory 
region of the enzyme (see below). With 
the availability of individual recom- 
binant isoenzymes of protein kinase C, 
Parker and his colleagues are exploring 
the basic residue requirements of the 
individual isoenzymes. 

The specificity and recognition sites 
for the cyclic nucleotide-dependent pro- 
tein kinases have been extensively stud- 
ied (reviewed in Ref. 5). For the cAMP- 
dependent protein kinase, the most typi- 
cal motif is RRXS'X but RXS*X and 
KKXXS*X are also encountered. The 
phosphorylation site sequence RRS* 
occurs in both cardiac troponin and 
hormone-sensitive lipase, but limited 
synthetic peptide studies indicate that 
the Arg adjacent to the Ser(*) is less 
important than the second Arg. There is 
also evidence from synthetic peptide 
studies that more distal Arg residues on 
the amino-tenninal side may have a 
positive influence. Indeed the heat- 
stable inhibitor (Walsh inhibitor) of this 
enzyme has the pseudosubstrate motif 
GRTGRRNA*] with Ala occupying the 
equivalent of the Ser phosphate accep- 
tor site. In scanning sequences for 
cAMP-dependent protein kinase sites 
the search pattern is RXS*, then RRXS* 
or KRXXS*. and not S*XR as the car- 
boxy-terminal basic residue tends to be 
deleterious. The only known examples 
of RXS* motifs where X is not Arg are 
exogenous substrates phosphorylated 
in vitro. The sites phosphorylated in 
vivo by the cAMP-dependent protein 
kinase all contain multiple adjacent Arg 
residues in the arrangements RRXS* or 
RRS\ with two having the latter motif. 
A hydrophobic residue is often, but not 
exclusively, found after the Ser. While 
the yeast cAMP-dependent protein 
kinase specificity differs in several 
respects, the fact that it can be comp- 
lemented by the mammalian enzyme in 
yeast suggests that the differences do 
not override the natural function. 
Recognition site specificity of the 
cGMP-dependent protein kinase has 
been studied in detail by Glass and his 
colleagues 5 . From model peptide stud- 
ies, there is evidence for a requirement 
for an Arg located on the carboxy-termi- 
nal side of the phosphorylated residue 
in the sequence S*R or T*R; however 
this is not an absolute requirement as 
the enzyme has been shown to phos- 
phorylate sites without this motif. The 
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Table I. Protein fdnaM phosphorylation site motifs 



Protein kinase 


Recognition motif 


Refs 


Serfaie snd threonine Idnnu 

Phosphorylase kinase 


KRKQIS'VR 


Chart 8 


Myosin light chain kinase 
[smooth muscle) 

Myosin light chain kinase 
(skeletal muscle) 


XKKRXXRXXS'X 


Kemp 5 


Myosicvi heavy chain kinase 


KXXS'XorRXfX 


Brzeska 10 


C al mod ulin-dependent 
protein kinase 1 


NYLRRLS'DSNF 


Czemik u 


Multifunctional calmodulin- 
dependent protein kinase II 


XRXXS'X 


Pearson® 


CalmoduiirHfependent 
protein kinase III 


RAGET*RFT*DT*RK 


Nairn" 


cAMP-dependent protein kinase 
(mammalian) 
(yeast) 


XRRXS'X 
XRRXS*X 


Zetter ovist 5 


cGMBdependent protein kinase 


XS*RX 


Glass 5 


Protein kinase C (a, P, y) 


XRXXS'XRX 


Graft 13 


56 kinase II 


XRXXS'X 


Erikson 14 


o^RNMependent kinase pp6S 


SELS'RR 


Colthurst 15 


dsDNA<Jependent kinase 


PEET*QT*QOQPMEEEE 


LeesMiller 1 ® 


Protease activated kinase 1 & n 


AKRRRLSS'LRA 


Wettenhall* 


Cell cycle kinase cdc-28, MPF 


XKS*PXorXKT"PX 


Langan 18 


PrcJine4ependent protein kinase 


XS'PXorXT'PX 


vuiiiet 19 


Growth factor regulated kinase 


PIT*PSGEA 


Countaway 20 


Casein kinase 1 


XS(P)XXS*XorXEXXS*X 


Pinna 5 


Casein kinase 11 


XS'XXEX 


Pinna 5 


Mammary gland casein kinase 


XS*XEXorXS"XS(P)X 


Pinna 5 


Glycogen synthase kmase-3 


XS*XXXS(P)X 


Fiol 21 


AMP-acovated protein kinase 
(acetyl CoA carboxylase kinase) 
(HMG-CoA reductase kinase) 
(hormone sensitive lipase kinase) 


MRSSMS'GLHL 

MIHNRS'KINL 

MRRSVS*EAAL 


Hardie 22 



precise requirements are insufficiently 
clear to allow cGMP-dependent protein 
kinase phosphorylation sites to be con- 
fidently identified by scanning amino 
acid sequences alone. 

As early as 1970 Ribadeau-Dumas et 
al. 9 made correct predictions about the 
specificity requirements of the casein 
kinase from the mammary gland Golgi 
apparatus (5*XE or S # XS(P)) based on 
the sequence of the poryphosphoryl- 
ated region in casein. Casein kinase II 
specificity has been studied extensive- 
ly, in particular in the laboratories of 
Pinna and Krebs. This enzyme has a 
very widespread distribution and recog- 
nizes the motifs S*XXE and S*XXS(P). 



Not every site conforming to the motif 
is phosphorylated, casein kinase II 
phosphoryiates Serl7 and not Serl8 in 
the peptide from p casein A 2 , ESLSSSEE 
(nos 14-21). On the other hand, the 
mammary gland enzyme phosphoryi- 
ates Serl8 and not Serl7 in this 
sequence. Casein kinase I and Q have 
been shown to participate in hierarchai 
phosphorylation reactions by several 
groups. Roach and his colleagues found 
that prior phosphorylation of glycogen 
synthase by cAMP-dependent protein 
kinase at Ser7 caused casein kinase I to 
phosphorylate SerlO, whereas phos- 
phorylation of glycogen synthase by 
casein kinase fl provides the recognition 
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Table I. Protein kinase phosphorylation site motifs (continued) 



Protein kinase 



pyruvate dehydrogenase kinase 

Branched chain oketoacid 
dehydrogenase kinase 

Heme regulated elF2a kinase 

Endogenous elF4E kinase 

Hlstone H4 kinase I 

Hfstone H4 kinase II 

Isocrtrate dehydrogenase\ 
kinase (£. coW) 

^Adrenergic receptor kinase 



Rhodopsin kinase 
Tropomyosin kinase 



Tyrosine kinases 

p60" rc 

056^ 

0908**** 
pl20 g*aw 

EOF receptor 
InsuHn receptor 
p75^ 

Spleen tyrosine kinase 



Recognition motif 



Refe 



s'msdpgvs'yrygmgts've 

GHHS*T5D0 and SYRS'VDE 
LSELS'RR 

kndks'ktwq 
vkris'glg 

AcS*GRGKGG 
GIRS*LNVAIR 

GYSVnGNT'GEQS'GWksG 
T'ECHXfeGT^VPS'DNIDS'QtXh 

sTnds'u 

d£as*t*t*vktets*qva 

dnalnd!t5'l-c00h 



RUEDNEY*TARQGAK 

ruedneyTaregak 

peedgery'dedee 

reeadgvy'aasgglr 

RKiEDNEY'TAREGAK 
EEKEY*HA£ 

taenaey*lrvap 
trdiy"etdyYrk 
drvy*vhpf 
edaey*aarrrg 



Edelman 23 

Paxton 24 

Kudiicki 25 

Rychlik 26 

Masaracchia 5 

Masaracchia 5 

Thorsness 27 

Hausdorff 28 
Pataewski 29 
Watson 30 



Geahlen 5 
Geahlen 5 
Geahlen 5 
Geahlen 5 
Geahlen 5 
Geahlen 5 
Geahlen 5 
Geahlen 5 
Geahlen 5 
Geahlen 5 



8 R. E. H. Wettenhail and N. Morrice, unpublished. 



Ser(P) for glycogen synthase kinase-3. 
This enzyme recognizes the motif 
XS*XXXS(F)X. It is of interest that all 
four of the Ser/Thr kinases studied that 
utilize GIu or Asp as specificity determi- 
nants also recognize Ser(P). There are 
also examples where phosphorylation 
at one site by a particular protein 
kinase can suppress the phosphoryl- 
ation of a nearby residue by another pro- 
tein kinase. The nearby phosphoryl- 
ation sites in the hormone sensitive 
lipase, MRRSVSEA (nos 560-567) exhib- 
it this behaviour with mutually exclu- 
sive phosphorylation 17 by the cAMP- 
dependent protein kinase (Ser563) and 
caimodulin-dependent protein kinase II 



(Ser565). ft seems likely that examples 
of hierarchal phosphorylation will 
become more frequent as the specifici- 
ties of additional protein kinase are 
studied. So far hierarchal phosphoryl- 
ation has been observed between nearby 
phosphorylation site sequences, as well 
as over distances of 25 residues in the 
case of the regulatory subunit Rj, phos- 
phorylation by casein kinase n and 
glycogen synthase kinase-3; it is poss- 
ible that even more distant interactions 
involving higher orders of structure 
may occur. 

The recognition sites for a number of 
tyrosine protein kinases have been 
examined. Inspection of the known 



phosphorylation sites indicates that 
acidic residues are often located near 
the tyrosine phosphate acceptor site 
and their importance has been demon- 
strated with model peptides in several 
instances. In general, few natural sub- 
strates (excluding autophosphoryl- 
ation) have been reported for tyrosine 
protein kinases and this has meant that 
we have not had the benefit of compari- 
sons. In summarizing our knowledge of 
tyrosine phosphorylation site motifs, 
Geahlen and Harrison 5 noted: 'Ob- 
servations drawn from the study of syn- 
thetic peptides have frustrated efforts 
to clearly define primary structural 
determinants that are involved in the 
recognition of substrates by tyrosine 
kinases. It is perhaps some consolation 
to investigators of tyrosine kinase sub- 
strate specificity that not all peptides 
containing tyrosine residues are sub- 
strates*. Clearly some fresh approaches 
are required. 

Phosphorylation site motifs 

The progress in recognizing specific 
phosphorylation site motifs for many 
protein kinases has led to the expec- 
tation of being able to scan protein 
sequences and identify phosphoryl- 
ation sites for given protein kinases. 
However, considerable caution is 
required as there are too many excep- 
tions at present to accept the phosphor- 
ylation site motifs listed in Table I as 
'canons' of recognition. We do not know 
precisely which nominal specificity 
determinants actually have correspond- 
ing residues in protein kinase active 
sites and which just favour an optimum 
conformation. Hopefully this dilemma 
will be short lived with the imminent 
solution of the X-ray structure of the 
cAMP-dependent protein kinase com- 
plex with substrate and inhibitor pep- 
tides. Phosphorylation site sequence 
studies such as those made by Cohen 
and his colleagues have been of enor- 
mous value in contributing to our 
knowledge of protein kinase specificity 
and the development of the concepts of 
recognition motifs, but their most 
enduring value is in answering the diffi- 
cult question of which sites are actually 
functionally significant in vivo. 
Structures resembling protein kinase 
phosphorylation site motifs also play 
an important role in the regulation of 
protein kinases. These structures, called 
pseudosubstrate prototopes, are lo- 
cated In the regulatory domains of pro- 
tein kinases and are responsible for 
maintaining protein kinases in inactive 
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forms 3 . The pseudosubstrate sequences 
typically contain an alanine in place of 
the serine or threonine found in the 
phosphorylation site motifs. For protein 
kinase C, the pseudosubstrate sequence 
occurs between residues 19 and 31, 
RFARKGA*LRQKNV, which resembles 
the substrate motif RXXS*XR (Table I) 
with Ala25 in place of the phosphate 
acceptor site. In this case, activation of 
protein kinase C by binding of diacyl- 
ghycerol is thought to induce a confor- 
mational change that removes the pseudo- 
substrate structure from the active site 
allowing access to substrates. One can- 
not fail to be impressed with the way 
nature has utilized the same features 
responsible for substrate recognition to 
regulate these important enzymes. 
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